Mining Indirect Associations in Web Data

نویسندگان

  • Pang-Ning Tan
  • Vipin Kumar
چکیده

ABSTRACT Analysis of asso iation is an important Web mining te hnique be ause it an provide useful insight into the navigational behavior of Web users. E-tailers an use this information to develop strategi marketing plans and to re-stru ture their Web site in order to enhan e the browsing experien e of their ustomers . Previous work on mining Web asso iations has fo used primarily on nding frequent a ess patterns in the data. These patterns an be generated by Web users who share similar information goals or by those with varying interests. Sin e Web asso iation patterns onsider only o-o urren es in data, it is diÆ ult to identify patterns generated by one group of Web users but not by the others. Another drawba k of the existing approa h is that it does not adequately address the impa t of Web site stru ture on the support of a Web page. As a result, the majority of Web asso iation patterns dis overed using onventional te hniques ontain the home page or other referen e pages that have multiple outgoing links. In this study, we apply a new mining te hnique alled indire t asso iation to Web usage data. This novel te hnique is apable of ombining the various asso iation patterns into a more ompa t stru ture. It an also apture both positive and negative orrelations that exist in the data. We demonstrate the appli ability of this te hnique on Web data from both ommer ial and resear h institutions. Our analysis shows very promising results, espe ially in terms of identifying Web users with distin t interests.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Indirect Positive and Negative Association Rules in Web Usage Mining

One of the purposes of Web usage mining is to find out interesting user association rules from web server logs. It has become vital for personalization, effective web site management, business and support services, creating adaptive web sites, and so on. In the web domain, items correspond to pages and transactions to user sessions. Indirect associations, type of infrequent pattern provide usef...

متن کامل

Automatic Discovery of Technology Networks for Industrial-Scale R&D IT Projects via Data Mining

Industrial-Scale R&D IT Projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. When planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. Discovery of this network of technologies is time consumi...

متن کامل

Efficient Mining of Indirect Associations Using HI-Mine

Discovering association rules is one of the important tasks in data mining. While most of the existing algorithms are developed for efficient mining of frequent patterns, it has been noted recently that some of the infrequent patterns, such as indirect associations, provide useful insight into the data. In this paper, we propose an efficient algorithm, called HI-mine, based on a new data struct...

متن کامل

Mining Indirect Association between Itemsets

Discovering association rules is one of the important tasks in data mining. While most of the existing algorithms are developed for efficient mining of frequent patterns, it has been noted recently that some of the infrequent patterns, such as negative associations and indirect associations, provide useful insight into the data. Existing indirect association mining algorithms mine indirect asso...

متن کامل

Exploring Biomolecular Literature with EVEX: Connecting Genes through Events, Homology, and Indirect Associations

Technological advancements in the field of genetics have led not only to an abundance of experimental data, but also caused an exponential increase of the number of published biomolecular studies. Text mining is widely accepted as a promising technique to help researchers in the life sciences deal with the amount of available literature. This paper presents a freely available web application bu...

متن کامل

Mining Indirect Association Rules for Web Recommendation

Classical association rules, here called “direct”, reflect relationships existing between items that relatively often co-occur in common transactions. In the web domain, items correspond to pages and transactions to user sessions. The main idea of the new approach presented is to discover indirect associations existing between pages that rarely occur together but there are other, “third” pages,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001